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Summary 

The bat coronavirus HKU4 belongs to the same 2c lineage as that of the deadly Middle East 
Respiratory Syndrome coronavirus (MERS-CoV) and shows high sequence similarity, therefore 
potentiating a threat to the human population through a zoonotic shift or “spill over” event. To date, 
there are no effective vaccines or antiviral treatments available that are capable of limiting the 
pathogenesis of any human coronaviral infection. An attractive target for the development of anti- 
coronaviral therapeutics is the 3C-like protease (3CL pro ), which is essential for the progression of the 
coronaviral life cycle. Herein, we report the screening results of a small, 230-member peptidomimetic 
library against HKU4-CoV 3CL pro and the identification of 43 peptidomimetic compounds showing 
good to excellent inhibitory potency of HKU4-CoV 3CL pro with IC 50 values ranging from low micromolar 
to sub-micromolar. We established structure-activity relationships (SARs) describing the important 
ligand-based features required for potent HKU4-CoV 3CL pro inhibition and identified a seemingly 
favored peptidic backbone for HKU4-CoV 3CL pr0 inhibition. To investigate this, a molecular sub- 
structural analysis of the most potent HKU4-CoV 3CL pr0 inhibitor was accomplished by the synthesis 
and testing of the lead peptidomimetic inhibitor’s sub-structural components, confirming the activity of 
the favored backbone (22A) identified via SAR analysis. In order to elucidate the structural reasons 
for such potent HKU4-CoV 3CL prp inhibition by the peptidomimetics having the 22A backbone, we 
determined the X-ray structures of HKU4-CoV 3CL pro in complex with three peptidomimetic inhibitors. 
Sequence alignment of HKU4-CoV 3CL prp , and two other lineage C Betacoronaviruses 3CL pro ’s, 
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HKU5-CoV and MERS-CoV 3CL pr0 , show that the active site residues of HKU4-CoV 3CL pr0 that 
participate in inhibitor binding are conserved in HKU5-CoV and MERS-CoV 3CL pr0 . Furthermore, we 
assayed our most potent HKU4-CoV 3CL prp inhibitor for inhibition of HKU5-CoV 3CL prp and found it to 
have sub-micromolar inhibitory activity (IC 50 = 0.54 ± 0.03 pM). The X-ray structures and SAR 
analysis reveal critical insights into the structure and inhibition of HKU4-CoV 3CL pro , providing 
fundamental knowledge that may be exploited in the development of anti-coronaviral therapeutics for 
coronaviruses emerging from zoonotic reservoirs. 

1. Introduction 

Zoonotic viruses, including coronaviruses, Ebola virus, HIV, influenza A viruses, hantaviruses 
or henipaviruses, are ubiquitous and can emerge as significant human pathogens capable of global 
epidemics and pandemics. Most often, we spend our time and resources in targeting only the human 
pathogens with small molecule inhibitors with the ultimate goal of developing therapeutic compounds 
to treat the associated diseases. A missed opportunity, however, is targeting the zoonotic host with 
small molecule inhibitors with the goal of utilizing the acquired chemical and structural knowledge to 
inform us of the evolutionary path of the virus on a biological basis. So, we believe that by using 
small molecule compounds as probes of the structural evolution of viral enzyme drug targets, from 
the zoonotic reservoirs to the human pathogens, we can gain new insights and predict a priori the 
structural scaffolds of small molecule compounds that can serve as lead templates for therapeutic 
development against emerging human pathogenic viruses such as coronaviruses. 

Coronaviruses (CoVs) are enveloped, single-stranded, positive-sense RNA viruses that infect 
and cause disease in a variety of species including bats, birds, cats, dogs, pigs, mice, horses, 
whales, and humans. 1 ' 3 Coronaviral infections may range from mild to severe and can result in 
respiratory, enteric, hepatic, or neurological diseases in their carriers. The first two human CoV 
strains (HCoV-229E and HCoV-OC43) were identified in the mid-1960s, and it wasn't until the 21 st 
century that a new human coronavirus (Severe Acute Respiratory Syndrome or SARS-CoV) was 
identified. 4,5 Currently, there are at least six known human CoVs including: HCoV-229E, HCoV-NL63, 
HCoV-OC43, HCoV-HKUl, SARS-CoV, and most recently, the Middle East respiratory syndrome 
coronavirus (MERS-CoV). 6 MERS-CoV, formerly known as HCoV-EMC, was identified in November 
2012, when it was isolated from the sputum a 60-year-old Saudi Arabian man presenting acute 
pneumonia and renal failure. 7 Since the time of its identification, the virus has grown to be a threat to 
public health worldwide having a case-fatality rate of about 30%. 8 To date, there are no vaccines or 
antiviral agents capable of preventing or treating any human coronaviral infection. 





ACCEPTED MANUSCRIPT 


3 

74 MERS-CoV belongs to lineage C in the genus Betacoronavirus of the Coronaviridae family in 

75 the Nidovirales order. 9 Also of this lineage are the species Tylonycteris bat coronavirus HKU4 

76 (HKU4-CoV) and Pipistrellus bat coronavirus HKU5 (HKU5-CoV), where the overall amino acid 

77 sequence identities of MERS-CoV to HKU4-CoV and HKU5-CoV across the conserved domains are 

78 approximately 75% and 76.7%, respectively. 6,10,11 Though the exact origin of MERS-CoV is currently 

79 debated, a bat origin is strongly suspected as MERS-CoV is so closely related to HKU4- and HKU5- 

80 CoV and because MERS-CoV genomic RNA has been found in bats and dromedary camels in 

81 Qatar. 12,13 Though HKU4-CoV and HKU5-CoV have been found only in bats, studies have shown that 

82 their accessory proteins are capable of inhibiting human antiviral signaling pathways in vitro.' 4 ' 5 This, 

83 and the close similarity of MERS-CoV to HKU4-CoV and HKU5-CoV, suggests that a zoonotic shift 

84 from bats or camels to humans may have occurred. 15 A recent investigation into the interactions 

85 between the human CD26 receptor and the receptor binding domains (RBDs) in the MERS-CoV, 

86 HKU4-CoV and HKU5-CoV envelope-embedded spike protein revealed that MERS-CoV and HKU4- 

87 CoV both engage this receptor for viral entry whereas HKU5-CoV does not. 16 These observations 

88 suggest that an evolutionary pathway from bat HKU4-CoV to human MERS-CoV exists and that 

89 investigating the molecular basis of this zoonotic shift from a structural and chemical-biology 

90 perspective may allow us to predict and target the these viruses with small molecule therapeutics. 

91 Coronaviral genomes are polycistronic, encoding for two large polyproteins, ppla and pplab. 17 

92 20 Initiation of coronavirus replication in cells occurs by the translation of two overlapping, open 

93 reading frames (ORFIa and ORFIb) to produce ppla and, following a -1 ribosomal frameshift 

94 mechanism, pplab. These polyproteins are then proteolytically processed at 14 cleavage sites by 

95 two essential viral cysteine proteases, the papain-like protease (PL pr0 , or nsp3) and the 3C-like 

96 protease (3CL pr0 , also known as the main protease, M pr0 , or nsp5). Cleavage by both proteases 

97 results in the production of 16 nonstructural proteins (nsps), where PL pro is responsible for cleavage at 

98 3 sites and 3CL pro is responsible for cleavage at 11 sites (Figure 1). The function of 3CL pr0 is vital for 

99 the coronaviral life cycle, making it an attractive target for the development of antiviral drugs. 21,22 

100 The present work was undertaken to investigate the kinetic and structural properties of HKU4- 

101 CoV 3CL pro and to utilize this knowledge to discover and develop potent inhibitors of HKU4-CoV 

102 3CL pr0 . Targeting the immediate zoonotic reservoirs of coronaviruses with small molecule inhibitors 

103 can help inform structure-based design strategies aimed at creating molecular scaffolds that may also 

104 target the emerging human CoVs and ultimately aid in the development of therapeutics against 

105 coronaviral infection. Towards this goal, we first expressed, purified, and characterized the kinetic 

106 properties of HKU4-CoV 3CL pro . We then determined the inhibition of HKU4-CoV 3CLpro by a small 

107 library of 230 peptidomimetic compounds, which resulted in the identification of 43 HKU4-CoV 
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3CLpro inhibitors, two of which showed sub-micromolar potency. With a potent lead inhibitor 
compound in hand, we investigated the contributions of its individual sub-structural components to 
inhibitory potency and identified a favored peptidic backbone for HKU4-CoV 3CL pr0 inhibition. Finally, 
we determined the X-ray crystal structures of the three most potent inhibitors containing this favored 
backbone in complex with HKU4-CoV 3CL pr ° and elucidated the structural reasons behind such potent 
HKU4-CoV 3CL pro inhibition. 

2. Results and Discussion 

2.1. Screen of peptidomimetic library for HKU4-CoV 3CL pro inhibition 

A library of 230 previously reported, peptidomimetic-type compounds 2324 was screened for 
inhibition of HKU4-CoV 3CL pr0 at a single concentration of 100 pM, and IC 50 values were determined 
for compounds that showed greater than 50% inhibition of HKU4-CoV 3CL pr0 at that concentration. 
This screen resulted in the identification of 43 peptidomimetic compounds with inhibitory 
concentrations ranging from micromolar to sub-micromolar (vide infra). The peptidomimetic inhibitors 
can be grouped into two classes, inhibitors with a single amide bond in the inhibitor backbone (Class 
A) and inhibitors with a dipeptide-like backbone (Class B). The results of this screen are displayed in 
Table 1 a (Class A) and Table 1 b (Class B), where only the compounds that showed greater than 50% 
inhibition of HKU4-CoV 3CL prp at 100 pM are shown (see Supporting Information SI and S2 for 
comprehensive list of compounds tested). The compounds are listed in the tables in rank order by 
IC 50 values starting with the lowest IC 50 value. Of the 25 Class A inhibitors that were tested for 
inhibition of HKU4-CoV 3CL pr0 , 19 were found to inhibit HKU4-CoV 3CL pro above 50% at a 
concentration of 100 pM, and subsequently, IC 50 values were determined. Remarkably, the two most 
potent inhibitors, 1A and 2A, proved to be sub-micromolar inhibitors of HKU4-CoV 3CL pro , having 
IC 50 ’s of 0.33 ± 0.02 and 0.41 ± 0.04 pM, respectively. Upon examination of all 27 Class A inhibitors, 
an immediate trend is apparent where there is an overwhelming preference for a 3-thiophene at the 
R 1 position and a 1-methylbenzotriazole at the R 3 position of the peptidomimetic backbone. This is 
observed in every Class A inhibitor found to produce inhibition of HKU4-CoV 3CL pr0 over 50% at 100 
pM, with the exception of the least active analogue in this series, 19A. 19A only differs from the more 
active compound 6A in that 19A has a triazole in place of a benzotriazole; however, this difference is 
significant enough to decrease inhibitory potency ten-fold between the analogues, from 1.6 ± 0.1 pM 
for 6A to 16.0 ± 4.0 pM for 19A. Interestingly, 19A maintains a triazole ring in the R 3 position, 
indicating that the position of the nitrogen atoms in this heterocycle is essential for good HKU4-CoV 
3CL pro inhibition. Other R 3 heterocycles incorporated in the less active members of Class A include 4- 
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141 substituted methyltriazoles, 4-imidazoles, and 2-furans, underscoring the importance of the 

142 methylbenzotriazole at the R 3 position and pointing to the potential significance of a precisely oriented 

143 hydrogen bond acceptor (see Supporting information SI for a comprehensive list). 

144 Of the 19 Class A peptidomimetics found to be good inhibitors of HKU4-CoV 3CL pro , 13 have 

145 amide substituents at R 2 , four have aromatics (5A, 7A, 9A, and 11 A), and two have amines (16A and 

146 17A). Among the inhibitors with amide substituents at R 2 , a trend can be observed where cyclic 

147 carboxamides are preferred (1A, 2A, 3A, 4A, and 8A) to alkyl carboxamides, with the aromatic 

148 carboxamides being the most active (1A and 2A). A preference for larger cyclic carboxamides is also 

149 observed, where 4, 5 or 6 membered cycles (1A, 2A, 3A, and 4A) are preferred over the smaller 

150 cyclopropylcarboxamide (8A). 

151 Among the six Class A inhibitors with aromatic groups substituted at the R 2 position (5A, 7A, 

152 9A, 11 A, 20A, 21 A), only the four compounds 5A, 7A, 9A and 11A were found to inhibit HKU4-CoV 

153 3CL pr0 above 50% at 100 pM and they each contain aromatic nitrogen heterocycles. The position of 

154 the nitrogen is of moderate significance, as a p-substituted pyridine at R 2 is 2-fold more active than 

155 the m-substituted pyridine (5A vs. 11 A, Table la). Bulky substituents on R 2 aromatic heterocycles, 

156 such as in 20A, decrease inhibition of HKU4-CoV 3CL pro , indicating a steric requirement within the R 2 

157 binding pocket (Table la and SI). Interestingly, 20A is one of only two Class A compound having a 

158 3-thiophene at the R, position and a 1-methylbenzotriazole at the R 3 position that is not a good 

159 inhibitor of HKU4-CoV 3CLpro at 100 pM. Addition of a methoxy substituent to the R 2 heterocycle is 

160 tolerated (7A and 9A, Table la), but does not increase or decrease inhibition significantly, indicating 

161 that the addition of another hydrogen-bond acceptor does not further engage the R 2 binding pocket 

162 and that smaller aromatic substituents can be tolerated. 

163 The two Class A peptidomimetics that have R 2 amines, 16A and 17A, have similar IC 50 values 

164 (4.8 ± 0.4 and 5.3 ± 0.6 pM, respectively); however, they have vastly different steric requirements 

165 (methyl vs. benzyl). As opposed to the R 2 amides and aromatics, this may indicate that when an 

166 amine substituent is present at the R 2 position, it dictates an alternative binding orientation within the 

167 R 2 binding space that does not lead to such strict steric requirements. Alternatively, the relatively 

168 hydrophobic benzyl group of 17A may decrease inhibitor solubility and consequent inhibitor 

169 availability, therefore resulting in lower % maximum inhibition and IC 50 value than would otherwise be 

170 observed. This may also be the case for 21A, where the R 2 aromatic ring lacks the nitrogen found in 

171 the more active inhibitors 5A and 11A (Table la and SI). 

172 A comparison of 17A to the second most active inhibitor in Class A, 2A, which is only different 

173 from 17A by the presence of a carbonyl, shows that the carbonyl of 2A is crucial for sub-micromolar 
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174 inhibitory potency (0.41 ± 0.04 pM for 2A vs. 5.3 ± 0.6 pM for 17A). This observed 10-fold increase 

175 in IC 50 may be a consequence of the loss of a stabilizing hydrogen-bond interaction between the 

176 amide -NH group or carbonyl oxygen of the inhibitor and the HKU4-CoV 3CL pr ° binding site. The 

177 change in hybridization between an amine and an amide also results in a change in the molecular 

178 geometry of the R 2 substituent, which may alter hydrogen-bonding properties and introduce additional 

179 effects that may be important factors in dictating good enzymatic inhibition. The X-ray structure of 

180 HKU4-CoV 3CL pr0 in complex with 2A, discussed below, shows that the amide -NH group of the 

181 carboxamide group of 2A forms an important interaction with the backbone carbonyl of His41. 

182 Of the 205 Class B inhibitors that were tested for inhibition of HKU4-CoV 3CL pro , 24 were found 

183 to inhibit HKU4-CoV 3CL pro above 50% at a concentration of 100 pM, and subsequently, IC 50 values 

184 were determined (Table 1b). Interestingly, we found fewer good inhibitors and no sub-micromolar 

185 inhibitors of HKU4-CoV 3CL pr0 among this larger, more substituted dipeptide-like class of compounds 

186 (~76% of compounds in Class A (19 of 25 compounds tested) were good inhibitors of HKU4-CoV 

187 3CL pr0 while only ~12% of compounds in Class B (24 of 205 compounds tested) were found to be 

188 good inhibitors). This result could be a consequence of the increased steric bulk perturbing the 

189 preferred binding orientation of the Class B compounds in the HKU4-CoV 3CL pr0 binding site. A 

190 comparison of similar Class A and Class B inhibitors more clearly elucidates this trend, where the 

191 addition of the sterically bulky R 4 f-Bu amide in the Class B compounds decreases HKU4-CoV 3CL prp 

192 inhibition in each instance (S3). For example, the most active Class A inhibitor, 1A, having and IC 50 of 

193 0.33 ± 0.02 pM and 93% inhibition at 100 pM, is almost completely inactivated by the addition of the t- 

194 Bu amide as seen in the Class B compound 25B, with only 26% inhibition at 100 pM. Within the 

195 Class B inhibitors, there is a preference for the 3-thiophene at the R 1 position and a 1- 

196 methylbenzotriazole at the R 3 position of the peptidomimetic backbone, though it is not as strong as 

197 that observed in within Class A (25% of compounds in Class B vs. 95% of compounds in Class A). 

198 Among the series where R 1 = 3-thiophene and R 3 = 1-methylbenzotriazole (compounds 3B, 6B, 9B, 

199 16B, 18B, and 21B) small carboxamide groups at the R 2 position are favored as observed by the 

200 comparison of 3B to 28B and 6B to 27B (see Tables 2 and 3). Possibly, Class B inhibitors with small 

201 R 2 amides allow for a shift in the position of the inhibitor in the HKU4-CoV 3CL pr0 binding site to 

202 accommodate the bulky t-Bu amide at the R 4 position. Again, an R 2 amide is preferred to an amine 

203 for good inhibition of HKU4-CoV 3CL prp (IC 50 = 1.8 ± 0.5 pM for 3B vs. 22.0 ± 7.4 pM for 21B). 

204 Replacement of the benzotriazole within this series with a benzimidazole has dissimilar effects; for 

205 example, comparing 19B (Table 1b) to 28B (S2 and S3) where the benzimidazole of 19B has been 

206 replaced with a benzotriazole, inhibition decreases from 68% to 34% at 100 pM, respectively. In 
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207 contrast, comparison of 23B to 18B shows that substitution of benzimidazole for benzotriazole 

208 increases inhibition from 56% to 76% at 100 pM (Table 1b). Interestingly, only Class B compounds 

209 having a para-substituted phenyl ring with an R 2 substituent were found to be good inhibitors of 

210 HKU4-CoV 3CL pr °; compounds with other, non-phenyl rings including cyclopropyl, cyclohexyl, 

211 adamantane, and bicycloheptane were not found to be active inhibitors. 

212 

213 2.2.1A-RFL Sub-structural activity analysis via component synthesis 

214 In an attempt to delineate the structural relationships that contribute to the enhanced activity of 

215 our most active inhibitor, 1A, we accomplished the synthesis of the sub-structural components of 1A 

216 and determined their individual inhibitory activities against HKU4-CoV 3CL pro (Scheme 1). The 

217 straightforward synthesis of these sub-structural 1A components, 22A, 23A, and 24A, was 

218 accomplished from the commercially available starting materials, benzotriazole and p- 

219 phenylenediamine. The synthesis of 22A was achieved in two steps. First, alkylation of 

220 benzotriazole by reaction with chloroacetic acid and sodium hydroxide yielded the benzotriazoleacetic 

221 acid 22 in 50% yield. 25 Subsequent coupling of 22 with the commercially available 

222 thiophenylmethanamine using EDC produced the desired 1A component 22A in 25% yield (Scheme 

223 1, top). The synthesis of 23A and 24A was accomplished linearly by first coupling p- 

224 phenylenediamine with the commercially available thiophenecarboxylic acid using HATU, which 

225 resulted in the desired 1A component 23A in 58% yield. Reductive amination of 23A and 

226 commercially available thiophenecarbaldehyde using sodium triacetoxyborohydride produced the 

227 desired 1A sub-structural component 24A in 81% yield. Finally, 1A was resynthesized according to 

228 the literature procedure by coupling 24A with 22 using EDC as a coupling reagent. 24 

229 The sub-structural components 22A, 23A, and 24A were then individually tested for inhibition 

230 of FIKU4-CoV 3CL prp at a concentration of 100 pM. The 22A component, representing the R 1 and R 3 

231 substituents of the Class A peptidomimetic backbone, showed 28% inhibition of FIKU4-CoV 3CL pro at 

232 100 pM and had an IC 50 of 68 pM, supporting our hypothesis that this backbone has some inhibitory 

233 capacity even in the absence of the R 2 component. The 23A sub-structural component, representing 

234 the R 2 substituent of the Class A peptidomimetic backbone, was found to have no inhibitory activity 

235 against FIKU4-CoV 3CL pro at 100 pM, supporting R, and R 3 as essential substituents. Additionally, the 

236 24A sub-structural component, representing the R^nd R 2 substituents of the Class A peptidomimetic 

237 backbone, showed no inhibition of HKU4-CoV 3CL pro at 100 pM, indicating the crucial nature of the 1- 

238 methylbenzotriazole at the R 3 position of the Class A backbone and supporting our previous 

239 observation of the need for a precisely oriented hydrogen bond acceptor at this position within the 
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240 Class A peptidomimetics ( vide supra). This finding establishes 22A as a minimum component 

241 necessary for inhibition of HKU4-CoV 3CL pro and suggests it may be used as a scaffold for future 

242 fragment based design of HKU4-CoV 3CL pr0 inhibitors. Interestingly, the work of Wong and coworkers 

243 showed that a series of benzotriazole esters, which act as potent, covalent inhibitors of SARS-3CL prp , 

244 are also able to inhibit SARS-3CL pr0 upon replacement of the ester oxygen with carbon, albeit 

245 weakly. 26 Taken together, these findings may indicate an inhibitory preference for benzotriazoles 

246 among coronaviral 3C-like proteases. 

247 

248 2.3. X-ray crystallographic analysis of HKU4-CoV 3CL pro in complex with 1 A, 2A, and 3B. 

249 In order to more fully elucidate the structural factors underlying the structure-activity 

250 relationships (SARs) observed in the kinetic data, we determined the X-ray crystal structures of 

251 HKU4-CoV 3CL prp in complex with three of the most potent inhibitors, 1A, 2A, 3B and the favored 

252 backbone 22A. These complexes were chosen to determine the potential structural explanations 

253 behind our observations that Class A and B inhibitor backbones, containing a 3-thiophene and 1- 

254 methylbenzotriazole, are favored for HKU4-CoV 3CL pro inhibition. We also sought to determine the 

255 structural reasons behind the preference for inhibition by Class A compounds over Class B 

256 compounds. 

257 Co-crystallization of 1A, 2A, 3B, and 22A with HKU4-CoV 3CL pro produced high-quality crystals 

258 suitable for X-ray data collection and structure determination. Complete X-ray data sets were 

259 collected on these four HKU4-CoV 3CL pro -inhibitor complexes to resolutions between 1.8 A and 2.3 A. 

260 A summary of the data collection and refinement statistics are provided in S4. 

261 The X-ray crystal structures confirm that both the Class A (1A and 2A) and B (3B) 

262 peptidomimetics bind in the active site pocket of HKU4-Co 3CL pr0 , which is lined primarily with polar 

263 residues and is solvent exposed (Figure 2). In each of the HKU4-CoV 3CL pr °-inhibitor complexes, the 

264 carbonyl carbon of the central inhibitor backbone, i.e preceding the R 3 group, is located between 4.5- 

265 4.8 A from the sulfur atom of the catalytic cysteine (Cys148, Figure 2, A, C, E). Both 1A and 2A, 

266 which have a 3-thiophene at the R 1 position and a 1-methylbenzotriazole at the R 3 position of the 

267 peptidomimetic backbone, position these moieties in identical chemical space (Figure 2, A & C). The 

268 Class B inhibitor, 3B, which has a 3-thiophene at the R 1 position and a 1-methylbenzotriazole at the 

269 R 3 position, also positions the 3-thiophene and 1-methylbenzotriazole functionalities in the same 

270 chemical space utilized by the Class A inhibitors (Figure 2, E). The 3-thiophene in the R 1 position of 

271 all three inhibitors occupies the S 2 -S 4 sub-sites of the FIKU4-CoV 3CL pr0 active site, the anilido and R 2 - 

272 functionality reside in the S^S/ sub-sites, and the R 3 benzotriazole occupies the S r subsite, acting as 

273 the P, group of the substrate. 
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274 The X-ray structures reveal the structural determinants for the inhibitory preference for Class 

275 A and B compounds with backbones containing a 3-thiophene and 1-methylbenzotriazole. This 

276 backbone allows the inhibitor to anchor into the HKU4-CoV 3CL pr0 active site via the formation of three 

277 key hydrogen bonds. First, there is a direct hydrogen bond that exists between the 3-nitrogen atom 

278 of benzotriazole and the te/e-nitrogen of Hisl 66 in each enzyme-inhibitor complex, securing the 

279 location and position of the benzotriazole in the active site. A second hydrogen bond between the 

280 carbonyl oxygen of the inhibitor backbone and the backbone nitrogen of Glu169 in each structure 

281 further anchors the inhibitor in the HKU4-CoV 3CL pro active site. Finally, the thiophene ring is 

282 positioned within the HKU4-CoV 3CL pro active site allowing the formation of a relatively uncommon 

283 hydrogen bond to the hydroxyl hydrogen of Tyr54. 27 ' 30 

284 In each of the FIKU4-CoV 3CL pr °-inhibitor complexes, the R 2 -substituted aryl ring of both Class 

285 A and B inhibitors is positioned in the FIKU4-CoV 3CL pro active site to take advantage of an edge-to- 

286 face pi-interaction with the catalytic histidine, His41 (Figure 2, A, C, E). This interaction is may be 

287 important for good inhibition of HKU4-CoV 3CL prp for two reasons. First, comparing 1A and 2A to 

288 22A, where the R 2 -substituted aromatic ring of 1A and 2A is absent, a greater than 200-fold increase 

289 in IC 50 is observed for 22A. Second, Class B compounds having non-aromatic substituents instead of 

290 an R 2 -substituted aryl ring were not found to be good inhibitors of FIKU4-CoV 3CL pro ( vide supra). 

291 These data indicate that for low micromolar to sub-micromolar inhibition of HKU4-CoV 3CL prp , an R 2 - 

292 substituted aryl ring may be required. Furthermore, there is a conserved active site water molecule 

293 present in the X-ray crystal structure of each FIKU4-CoV 3CL pr0 -inhibitor complex that mediates a 

294 hydrogen-bond between the backbone carbonyl oxygen of His41 and the amide R 2 nitrogen of the 

295 inhibitor (Figure 2, A, C, E). This finding illuminates the observed preference for an amide substituent 

296 at the R 2 position in both Class A and B inhibitors. 

297 Our attempt to solve the X-ray crystal structure of 22A in complex with FIKU4-CoV 3CL pro did 

298 not result in the structure of the HKU4-CoV 3CL pro -inhibitor complex, but instead resulted in the X-ray 

299 structure of the uniiganded, or free, FIKU4-CoV 3CL pro enzyme, which was found to crystallize in a 

300 different space group than the HKU4-CoV 3CL pr °-inhibitor complexes (Figure 3, A). The unbound, or 

301 free, HKU4-CoV 3CL pro enzyme was found to have noteworthy changes in the active site architecture 

302 in comparison to the X-ray crystal structures of the HKU4-CoV 3CL pr °-inhibitor complexes. In the 

303 absence of an inhibitor, what appears to be a metal ion binds to the FIKU4-CoV 3CL pro active site by 

304 coordinating to the catalytic cysteine (Cys148), which is rotated 180° from the inhibitor bound 

305 structures, resulting in a 3.2 A positional change in the location of the sulfur atom with a distance of 

306 2.2 A between the sulfur of Cys148 and the metal atom. The metal is presumed to be a zinc cation, 

307 as it is present in water and likely to bind to a sulfur atom (also seen in PDB ID 2YNA), though it could 
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308 also be iron. The te/e-nitrogen of His41 is shifted 2.6 A from its position in the HKU4-CoV 3CL prp - 

309 1A-RFL complex X-ray crystal structure, allowing for the coordination of the Zn 2+ atom, which is 

310 located 2.4 A away (Figure 3, A). Additionally, the positions of the Met25, Cys44, Ala46, Tyr54, and 

311 Gln192 residues all noticeably shift upon superposition of the free and inhibitor-bound forms of HKU4- 

312 CoV 3CL pro . 

313 Comparing the most potent Class B inhibitor having the 1-methylbenzotriazole and 3- 

314 thiophene backbone, 3B, to the top HKU4-CoV 3CL pr0 inhibitor identified, 1A, elucidates the observed 

315 preference for Class A inhibitors over Class B inhibitors (Figure 3, C). The (F7)-enantiomer of the 

316 Class B compound, 3B, was found in the FIKU4-CoV 3CL prp active site, with the backbone of the 

317 inhibitor oriented similarly to the backbones of the Class A inhibitors 1A and 2A, utilizing the same 

318 hydrogen bonds; however, the (Fi)-R 4 t- Bu amide substituent of 3B, which is absent in the Class A 

319 inhibitors, marginally rotates the thiophene away from Tyr54, increasing the distance to the hydrogen- 

320 bond donor Tyr54 from 3.9 to 4.5 A compared to 1A, and moving the thiophene out of hydrogen- 

321 bonding distance to Tyr54 (distance measured to heteroatoms). The R 4 t- Bu amide of 3B extends out 

322 toward the solvent exposed side of the active site, inducing a structural rearrangement of HKU4-CoV 

323 3CL pr0 where G!n192, which encloses 1A and 2A in the active site, is moved 2.7 A to accommodate 

324 the steric bulk of the R 4 substituent of 3B. The combination of these factors may be attributed for the 

325 observed 5-fold increase in IC 50 when comparing 1A and 2A to 3B and likely account for the reduced 

326 inhibitory activity of Class B compounds relative to Class A compounds. 

327 The Class B peptidomimetic compounds were synthesized and tested for HKU4-CoV 3CL pr0 

328 inhibition as racemates, which may lead to significant differences in IC 50 values. An example of this 

329 can be seen by the comparison of (f?)-13B to racemic- and (S)-13B (Figure 4). The enantiopure (, R)- 

330 13B has an IC 50 of 9.3 ± 0.5 pM while the 13B racemate has an IC 50 of 11.1 ± 0.6 pM and (S)-13B has 

331 a 50% reduction in inhibition relative to the racemate, with only 44% inhibition of FIKU4-CL pr0 at 100 

332 pM.. Interestingly, (R)- 3B was found in the active site of HKU4-CoV 3CL pr0 in the X-ray crystal 

333 structure (Figure 2, E and F), suggesting an (F?)-stereochemical of the inhibitor for good inhibition of 

334 FIKU4-CoV 3CL pro , despite the nature of the R 1 and R 3 heterocyclic groups. 

335 The X-ray structures of HKU4-CoV 3CL prp in complex with inhibitors provide vital insights into 

336 the types of compounds that can be developed as therapeutics against lineage C Betacoronaviruses. 

337 A sequence alignment of FIKU4-CoV, HKU5-CoV, and MERS-CoV 3CL pr0 , all of which are lineage C 

338 Betacoronaviruses, show that FIKU4-CoV and HKU5-CoV are 81.0% and 82.7% identical to MERS- 

339 CoV 3CL pr0 , and HKU4-CoV is 83.7% identical to HKU5-CoV 3CL pro (Figure 5). 31 ' 33 Moreover, the 

340 pertinent HKU4-CoV 3CL pro active site residues (Met25, His41, Cys44, Ala46, Tyr54, Cys148, Hisl 66, 
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341 Giu169, and Gln192, Figures 3 and 4) identified in the X-ray crystal structures that line the active 

342 site and interact with the peptidomimetic inhibitors are all conserved in HKU5-CoV and MERS-CoV 

343 3CL pr0 . These data indicate that the active site architectures among HKU4-CoV, HKU5-CoV, and 

344 MERS-CoV 3CL pr0 may be similar enough to design broad-spectrum antiviral therapeutics against 

345 lineage C Betacoronaviruses and have implications for the structure and inhibition of lineage C 

346 Betacoronaviruses yet to emerge. To test this hypothesis, we tested whether our lead HKU4-CoV 

347 3CL pr0 inhibitor, compound 1A, would also have inhibition of HKU5-CoV 3CL pro . We found that 

348 compound 1A is a good inhibitor of HKU5-CoV 3CL pro , having 91% inhibition at 100 pM and a sub- 

349 micromolar IC 50 value of 0.54 ± 0.03 pM which is comparable to the IC 50 value of 0.33 ± 0.02 pM 

350 against HKU4-3CL pr0 . Based on these data, we predict compound 1A will show inhibition of MERS- 

351 CoV 3CL pro as well. 

352 

353 3. Conclusions 

354 In conclusion, 43 of the 230 peptidomimetics were found to inhibit HKU4-CoV 3CL pro , 19 of 

355 which belonged to the Class A peptidomimetic series and 24 of which belonged to the Class B 

356 peptidomimetic series. Overall, the Class A peptidomimetics were found to be better inhibitors 

357 relative to the Class B compounds, and two of the Class A compounds were determined to have sub- 

358 micromolar inhibitory potency against HKU4-CoV 3CL pr0 , 1A and 2A. Analysis of the inhibition data 

359 from the 230 peptidomimetics resulted in the identification of a favored inhibitor backbone consisting 

360 of 3-thiophene and 1-methylbenzotriazole functionalities. To evaluate the relative contribution of the 

361 favored inhibitor backbone to the total HKU4-CoV 3CL pr0 inhibition observed for the molecule, we 

362 synthesized the sub-structural components of the lead inhibitor, 1A, and tested them for inhibition of 

363 HKU4-CoV 3CL pro . The simple compound, 22A, which represents the R, and R 3 inhibitor 

364 functionalities and consists of the 3-thiophene and 1-methylbenzotriazole of the favored backbone, 

365 was found to inhibit HKU4-CoV 3CL pra even in the absence of the anilido and R 2 components of the 

366 inhibitor. We then determined the X-ray crystal structures of the three most potent inhibitors 

367 containing this favored backbone, 1A, 2A, 3B, in complex with HKU4-CoV 3CL prp . These X-ray 

368 crystal structures prove that the three Class A and B inhibitors bind in the HKU4-CoV 3CL pro active 

369 site, utilizing the same set of three hydrogen-bonding interactions between the favored backbone 

370 backbone and the HKU4-CoV 3CL pr0 active site residues, serving to anchor compounds with this 

371 backbone into the active site of the enzyme. Our attempt to solve the X-ray structure of the privileged 

372 backbone compound, 22A, in complex with HKU4-CoV 3CL prp resulted in the structure of unbound 

373 HKU4-CoV 3CL pr0 , which is presumably as a consequence of weak inhibition by 22A (28% at 100 
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374 pM). Superposition of the unbound and inhibitor-bound forms of HKU4-CoV 3CL pr0 revealed 

375 several subtle, but important changes in the active site architecture upon inhibitor binding. 

376 Furthermore, sequence alignment of HKU4-CoV, HKU5-CoV, and MERS-CoV 3CL prp show that the 

377 residues involved in inhibitor binding are conserved, indicating that identification of broad spectrum 

378 3CL pr0 inhibitors of lineage C Betacoronaviruses is possible. In support of this, we found our most 

379 active HKU4-CoV 3CL pr0 inhibitor to have inhibition of HKU5-CoV 3CL pr0 in the sub-micromolar range 

380 (IC 50 = 0.54 ± 0.03 pM). These findings, and the structural details underlying the potent inhibition of 

381 HKU4-CoV 3CL pro by compounds containing 3-thiophene and 1-methylbenzotriazole backbones, can 

382 be used in the future to design compounds with the ability to more fully exploit the HKU4-CoV 3CL pr0 

383 active site, leading ultimately to the development of more highly potent and selective inhibitors 

384 against human pathogens such as SARS and MERS. 

385 

386 4. Experimental 

387 4.1. Expression and purification of HKU4-CoV 3CL pr0 

388 The gene encoding the 3CL pro of HKU4-CoV (residues 3292-3597 in the Tylonycteris bat-CoV 

389 polyprotein) 9 was codon optimized for expression in E. coli and cloned into pET-1 la expression 

390 vector with an N-terminal (His) 6 -tag followed by nsp4-/5 auto-cleavage site by BioBasic Inc. This 

391 construct results in the expression of HKU4-CoV 3CL prp without an N-terminal or C-terminal extension. 

392 E. coli BL21(DE3) cells, transformed with pET11a-HKU4 3CL pro plasmid were grown in Super LB 

393 media (3 g potassium phosphate monobasic, 6 g sodium phosphate dibasic, 20 g tryptone, 5 g yeast 

394 extracts, 5 g sodium in 1 L water, pH 7.2 adjusted with 1 M NaOH) in the presence of 1 mL 100 

395 mg/mL carbenicillin, 25 mL 8% lactose, 10 mL 60% glycerol, and 5 mL of 10% glucose per 1 L of 

396 expression culture for 24 hours at 25 °C. The cells were harvested by centrifugation (8,400 gfor 20 

397 min) to yield 15.5 g/L of cells. The cell pellet was then re-suspended in 5 mL of Buffer A/1 g cell 

398 pellet (50 mM Tris pH 7.5, 0.2 M ammonium sulfate, 0.05 mM EDTA, 5 mM BME) containing 1 mg/mL 

399 lysozyme. After the cells were homogenized, they were lysed via sonication for 10 minutes with 10 s 

100 pulses at 50% amplitude using a Branson Digital Sonifier. The cell lysate was clarified by pelleting 

101 the cell debris via centrifugation (28,960 g, 4 °C, 20 minutes) and loaded onto a 30 mL Phenyl 

102 Sepharose 6 Fast Flow HiSub column equilibrated with Buffer A. Protein was eluted with a gradient 

103 to 100% Buffer B (20 mM Tris pH 7.5, 0.05 mM EDTA, 5 mM BME) over five column volumes (150 

104 mL) collecting 5 mL fractions. Fractions containing HKU4 3CL prp were pooled and loaded onto a 60 

105 mL DEAE Sepharose Fast Flow column equilibrated with Buffer B. Protein was eluted with a gradient 

106 to 50% Buffer C (50 mM Tris pH 7.5, 1 M sodium chloride, 0.05 mM EDTA, 5 mM BME, 10% glycerol) 

107 over five column volumes (300 mL) collecting 5 mL fractions. Fractions containing pure HKU4-CoV 
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3CL pr0 were pooled, dialized into storage buffer (25 mM HEPES, pH 7.5, 2.5 mM DTT, 10% 
glycerol), and concentrated. The protein was then aliquoted into 1.5 ml_ push-cap Eppendorf tubes, 
flash frozen in liquid nitrogen, and stored at -80 °C in a freezer until further use. 

4.2. IC 50 Determination of inhibitors against HKU4-CoV 3CL pro at 25 °C and 37 °C: 

Each of the 230 dipeptide-like inhibitors were first screened for inhibition of HKU4 3CL pr0 at a 
concentration of 100 pM in duplicate assays containing the following assay buffer (50 mM HEPES, 
0.1 mg/mL BSA, 0.01% TritonX-100, 1 mM DTT). The assays were carried out in Costar 3694 
EIA/RIA 96-Well Half Area, Flat Bottom, Black Polystyrene plates from Corning Incorporated. 1 pL of 
100X inhibitor stock in DMSO was added to 79 pL of enzyme in assay buffer and the enzyme-inhibitor 
mixture was incubated for 10 minutes. The reaction was initiated by the addition of 20 pL of 10 pM 
UIVT3 substrate, a custom synthesized Forster resonance energy transfer substrate peptide with the 
following sequence: HilyteFluor™488-ESARLQSGLRKAK-QXL520™-NH 2 , producing final 
concentrations of 100 nM and 100 pM for the 3CL pr0 enzyme and UIVT3 substrate, respectively. The 
fluorescence intensity of the reaction was then measured over time as relative fluorescence units 
(RFU t ) for a period of 10 minutes, using an excitation wavelength of 485 and bandwidth of 20 nm and 
monitoring emission at 528 and bandwidth of 20 nm using a BioTek Synergy HI multimode 
microplate reader. 

The inhibition of HKU4-CoV 3CL pro by inhibitor compounds was monitored by following the 
change in RFUs over time, using the initial slope of the progress curve to determine the initial rate 
(V). The percent inhibition of the 3CL pro enzymes was determined using the following equation: 


% Inhibition — 


Inhibited 3CLpro RFU/s — Background RFU/s 
Uninhibited 3CLpro RFU/s — Background RFU/s. 


* 100 


Full IC 50 data were acquired for the compounds that showed greater than 50% inhibition of 
HKU4-CoV 3CL pr0 at 100 pM of inhibitor compound. The IC 50 values were determined at ambient 
temperature from 100 pL assays performed in triplicate in the following buffer: 50 mM HEPES, 0.1 
mg/mL BSA, 0.01% TritonX-100, 1 mM DTT. Kinetic assays were carried out in Costar 3694 EIA/RIA 
96-Well Half Area, Flat Bottom, Black Polystyrene plates from Corning Incorporated. Each inhibitor 
was tested at concentrations of 0.313, 0.652, 1.25, 2.5, 5.0, 10.0, 20.0, 40.0, 60.0, 80.0, 100.0, and 
120.0 pM; 1 pL of 100X inhibitor stock in DMSO was added to 79 pL of enzyme in assay buffer and 
the enzyme-inhibitor mixture was incubated for 10 minutes. The reaction was initiated by the addition 
of 20 pL of 10 pM UIVT3 substrate, producing final concentrations of 100 nM and 2 pM for the 3CL prp 
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enzyme and UIVT3 substrate, respectively. The fluorescence intensity of the reaction was then 
measured over time as RFU t for a period of 20 minutes, using an excitation wavelength of 485 and 
bandwidth of 20 nm and monitoring emission at 528 and bandwidth of 20 nm using a BioTek Synergy 
HI multimode microplate reader. 

The percent inhibition of the 3CL pro enzymes was then plotted as a function of inhibitor 
concentration. The SigmaPlot Enzyme Kinetics Wizard was used to fit the triplicate percent inhibition 
data and associated standard error to a non-linear Michaelis-Menten type regression model and 
determine the IC 50 for each enzyme using the following equation: 


% Inhibition — 


%/r, 


[Inhibitor] 


IC 5o + [Inhibitor] 


ui 


o 


ressic 


where %\ max \s the percent maximum inhibition of 3CL pr0 and the error in IC 50 values was determined as 
the error in the fitted parameter. 


4.3. Crystallization and X-ray structure determination of HKU4 3C-like protease in complex 
with inhibitors: 

The HKU4 3CL pr °-inhibitor complexes were co-crystallized from four different crystallization 
solutions. Briefly, the hanging-drop, vapor-diffusion method was used for crystallization by setting up 
drops and adding 1 pL of purified HKU4-CoV 3CL pr0 (2.5 mg/mL, 75 pM), that had been incubated for 
three hours with a 3 molar excess of the appropriate inhibitor, and 1 pL of reservoir solution. For 1A: 
15% PEG-3350, 2% tacsimate, 5% isopropanol, 0.1 M imidazole pH 6; for 3B: 20% PEG-3350, 0.2 M 
ammonium acetate, 0.1 M Bis-Tris pH 5.5; for 2A: 18% PEG-3350, 5% tacsimate, 0.2 M ammonium 
sulfate, 0.1 M Bis-Tris pH5.5; for 22A: 10% PEG-3350, 2% tacsimate, 5% isopropanol, 0.1 M 
imidazole pH 6.5. Protein crystals appeared between 14 hours and 7 days after setting up 
crystallization plates. Crystals were harvested with a nylon loop, which was then swiped through the 
same mother-liquor solution supplemented with 15% MPD. The crystals were cooled by plunging into 
liquid nitrogen and stored in shipping dewars containing liquid nitrogen until X-ray data collection at 
an available synchrotron could be performed. 

All diffraction data were collected at 100 K at the Life Sciences Collaborative Access Team 
(LS-CAT) at the Advanced Photon Source (APS) at Argonne National Laboratories. Crystals were 
transferred from shipping dewars into automated dewars and then and mounted robotically on a 
goniostat while under a stream of N 2 . X-ray data sets of HKU4 3CL pr °-inhibitor complexes were 
collected on a Rayonix 225 HE detector at a wavelength of 0.98 A. X-ray data were processed and 
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172 scaled using the program HLK2000, crystals of HKU4-CoV 3CL pr ° complexes with the inhibitors 1A, 

173 2A, and 3B belonged to the space group P2 1 2 1 2 1 and HKU4-CoV 3CL pr0 crystallized as a dimer in the 

174 asymmetric unit. Co-crystallization of HKU4-CoV 3CL pro with 22A did not result in an HKU4-CoV 

175 3CL pro :inhibitor co-complex, but instead yielded the apo form of HKU4-CoV 3CL pro , which crystallized 

176 in a different space group, PS^I with one dimer per asymmetric unit. 

177 The initial phases for HKU4-CoV 3CL prp in complex with 1A, 2A, and 3B were determined by 

178 molecular replacement using Phaser-MR (simple interface) in the Phenix suite and the protein 

179 database PDB file 2YNA. 34 The inhibitors were built and optimized from their SMILES strings using 

180 eLBOW and fit into the density using LigandFit in the Phenix suite and the program COOT. 

181 Structures were refined using Phenix. Water molecules were added manually to the 2F o -F c density 

182 peaks that were greater than 3.0 and iterative rounds of refinement were carried out until R wark an6 

183 R free reached their lowest values. The electron density maps presented in the figures were calculated 

184 using Phenix and COOT and the figures generated using the program PyMol. 

185 

186 4.4. General chemistry methods 

187 All reagents and chemicals were purchased from SigmaAldrich or Acros Organics and used 

188 without further purification. Anhydrous solvents were purchased or purified by passage through a 

189 solvent column composed of activated alumina and a supported copper redox catalyst. Analytical 

190 thin-layer chromatography was performed using Sorbent Technologies Glass-Backed Silica Gel HL 

191 TLC plates W/UV254. Flash chromatography was performed with Sorbent Technologies 200-400 

192 mesh silica gel. Nuclear magnetic resonance (NMR) data was acquired using either a Varian 

193 Inova300, Bruker ARX400, or Bruker AV-lll-800 spectrometers at 300, 400, or 800 MFIz for '\-\ or 75, 

194 100, or 200 MHz for 13 C. All compounds were purified by flash chromatography to >90% purity by 'H 

195 NMR. 

196 

197 2-(1 W-benzo[cd[1,2,3]triazol-1-yl)-AF(thiophen-3-ylmethyl)acetamide (22A). 

198 To 2-(1 /-/-benzo[c/j[1,2,3]triazol-1 -yl)acetic acid (1.5 eq.) in anhydrous dichloromethane (0.2 M) 

199 at 0 °C under nitrogen atmosphere was added EDC (3.0 eq.) and HOBT (3.0 eq.). The mixture was 

500 allowed to stir for five minutes before the addition of DIEA (13.0 eq.). The mixture was allowed to stir 

501 for an additional 20 minutes before the addition of thiophen-3-ylmethanamine (1.0 eq.). The reaction 

502 mixture was allowed to warm to ambient temperature and stirred under nitrogen atmosphere for 16 

503 hours. The reaction was quenched by addition of saturated potassium bicarbonate and the aqueous 

504 layer was extracted with ethyl acetate (3X). The combined organic layers were washed with water 

505 (IX) and brine (IX), dried over sodium sulfate, and concentrated. The crude residue was 
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recrystaliized from hot ethanol to provide the product as a beautiful white, crystalline solid in 30% 
yield. 'H NMR (800 MHz, CD 3 OD) (ppm): 8.03 (1H, d, J = 8.37), 7.72 (1H, d, J = 8.37), 7.59 (1H, dd, 
J = 8.37, 0.96), 7.48 (1H, dd, J = 8.37, 0.96), 7.40 (1H, dd, J = 4.94, 2.97), 7.29 (1H, m), 7.08 (1H, dd, 
J = 4.93, 1.30), 5.51 (2H, s), 4.46 (2H, s). 13 C NMR (200 MHz, CD 3 OD) (ppm): 166.47, 145.30, 
138.67, 133.83, 127.63, 126.92, 125.81, 124.28, 121.90, 118.60, 110.11, 49.87, 38.19. ESI-MS(+): 
273 [M + 1], 295 [M + 23], 


J 


A/-(4-aminophenyl)thiophene-2-carboxamide (23A). 

Thiophene-2-carboxylic acid (1.0 eq.) and HATU (1.4 eq.) were added to anhydrous 
dichloromethane (0.2 M) at room temperature followed by drop-wise addition of DIEA (3.0 eq.) under 
nitrogen atmosphere. The mixture was allowed to stir at ambient temperature for 5 minutes before 
benzene-1,4-diamine was added in one portion. The reaction mixture was allowed to stir for 14 hours 
before it was quenched with saturated potassium bicarbonate. The aqueous layer was extracted with 
ethyl acetate (3X) and the combined organic extracts were washed with water (IX) and brine (IX), 
dried over sodium sulfate, and concentrated. The crude yellow residue was purified by flash 
chromatography using a gradient of 20% to 70% ethyl acetate in hexanes to give the final product in 
50% yield. This reaction was performed from mg to g scale with no detrimental effect on yield. 'H 
NMR (800 MHz, CD 3 OD) (ppm): 7.84 (1H, d, J = 3.74), 7.67 (1H, dd, J = 4.97, 1.08), 7.37 (2H, d, J = 
8.56), 7.15 (1H, dd, J = 4.97, 3.74)/^ 3 C NMR (200 MHz, CD 3 OD) (ppm): 161.26, 144.66, 139.63, 
130.56, 128.71, 128.31, 127.49, 122.85. ESI-MS(+): 219 [M + 1], 241 [M + 23], 


A/-(4-((thiophen-3-ylmethyl)amino)phenyl)thiophene-2-carboxamide (24A). 

23A (1.0 eq.) and thiophene-3-carbaldehyde (1.0 eq.) were dissolved in anhydrous 1,2-DCE 
(0.2 M) at ambient temperature under nitrogen atmosphere. Sodium triacetoxyborohydride (1.6 eq.) 
and acetic acid (1.0 eq.) were then added in one portion. The reaction mixture was allowed to stir for 
30 minutes before the reaction was quenched with 1 N sodium hydroxide. The aqueous layer was 
extracted with ethyl acetate (3X) and the combined organic extracts were washed with water (IX) and 
brine (IX), dried over sodium sulfate, and concentrated. The crude yellow residue was purified by 
flash chromatography using a gradient of 20% to 70% ethyl acetate in hexanes to give the final 
product in 50-80% yield. 'H NMR (800 MHz, CD 3 OD) (ppm): 7.85 (1H, m), 7.69 (1H, m), 7.36 (3H, 
m), 7.26 (1H, m), 7.17 (1H, d, J =5.15), 7.12 (1H, d, J = 5.15), 6.70 (2H, m), 4.34-4.35 (2H, rotomeric 
peaks). 13 C NMR (200 MHz, CD 3 OD) (ppm): 172.76, 146.24, 141.08, 130.47, 128.20, 127.65, 
127.43, 126.82, 125.27, 122.87, 120.85, 112.81,43.08). ESI-MS(+): 315 [M + 1], 337 [M + 23], 
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7. Figure Legends 

Figure 1. Genome and proteome organization of HKU4-, HKU5-, and MERS-CoV non- 
structural proteins highlighting the PLpro and 3CL-pro cleavage sites. 

Figure 2. X-ray crystal structures of 1A, 2A and 3B in complex with HKU4-CoV 3CL pr0 . 
Inhibitors are colored according to atom type and shown in ball and stick representation. Wall-eye 
stereo-view is shown in panels A, C, and E. Water molecules are shown as red spheres. Hydrogen 
bonds are shown as gray dashes with distances between heteroatoms labeled (A). Interatomic 
distances (A) are displayed using a gray arrow. Electron density omit maps (F- F c ) are shown in 
grey mesh and are contoured to +3.0 around the inhibitor only. The binding orientation of each 
inhibitor is the same in the active site of each monomer of the dimer in the asymmetric unit of HKU4- 
CoV 3CL pro , therefore only one active site is shown for clarity. (A) HKU4-CoV 3CL pro (pale yellow) in 
complex with 1A (sky blue), PDB 4YOI,^'(B) 1A electron density omit map, (C) HKU4-CoV 3CL pr0 
(wheat) in complex with 2A (pale cyan), PDB 4YOJ, (D) 2A electron density omit map, (E) HKU4-CoV 
3CL pro (light orange) in complex with 3B (slate), PDB 4YOG, (F) 3B electron density omit map 
contoured at +3.0 (blue) and +2.0 (grey). 

Figure 3. (A) Superposition of unbound HKU4-CoV 3CL pro (gray, PDB 4Y09) and HKU4-CoV 
3CL pro co-crystallized with inhibitor 1A (where HKU4-CoV 3CL pr0 is shown in ribbon representation in 
light yellow and 1A is shown in sky blue and colored by element, PDB 4YOI). The Zn atom in the 
unbound HKU4-CoV 3CL pro is represented as a gray sphere. (B) Superposition of HKU4-CoV 3CL pr0 
co-crystallized with 1A (where HKU4-CoV 3CL pr0 is shown in ribbon representation in light yellow and 
1A is shown in sky blue and colored by element, PDB 4YOI) and HKU4-CoV 3CL pr0 co-crystallized 
with 2A (where HKU4-CoV 3CL pro is shown as cartoon in wheat and 2A is shown in pale cyan and 
colored by element, PDB 4YOJ). (C) Superposition of HKU4-CoV 3CL pr0 co-crystallized with 1A 
(where HKU4-CoV 3CL pro is shown in ribbon representation in light yellow and 1A is shown in sky blue 
and colored by element, PDB 4YOI) and HKU4-CoV 3CL pr0 co-crystallized with 3B (where HKU4-CoV 
3CL pro is shown as cartoon in light orange and 3B is shown in slate and colored by element, PDB 
4YOG). Wall-eye stereo-view is shown in each panel. 

Figure 4. Effect of 13B stereocenter configuration on HKU4-CoV 3CL pr0 inhibition. Where % 
inhibition is determined at 100 pM inhibition concentration, n/t = not tested. 

Figure 5. Sequence alignment of HKU4-CoV, HKU5-CoV, and MERS-CoV 3CL pr0 where the 
top row indicates the secondary structure of HKU4-3CLpro at a particular residue position, conserved 
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residues among ail three 3CL pro s are shaded in red, and residues that have similar steric or 
electronic properties are shown in red text with no shading. Non-conserved residues are displayed in 
black text. The pertinent active site residues that border and interact with the peptidomimetic 
inhibitors (Met25, His41, Cys44, Ala46, Tyr54, Cys148, Hisl 66, Glu169, and Gln192) are indicated by 
blue arrows. 


8. Scheme Legends 

Scheme 1. Synthesis of sub-structural components of 1A. 

9. Tables 

Table la. Class A peptidomimetic inhibitors. _ 
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Table 1b. Class B peptidomimetic inhibitors. 
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